Similar classes latent distribution modelling-based oversampling method for imbalanced image classification
نویسندگان
چکیده
Learning an unbiased classifier from imbalanced image datasets is challenging since the may be strongly biased toward majority class. To address this issue, some generative model-based oversampling methods have been proposed. However, most of these pay little attention to boundary samples, which contribute tiny learning classifier. In paper, we focus on samples and propose a similar classes latent distribution modelling-based method. Specifically, first, model each class as different von Mises–Fisher distributions, thereby aligning feature with distributions. Furthermore, develop distance minimization loss function, makes representations close other. way, generator can capture more shared features during training. addition, sampling strategy, uses variables near decision generate samples. These expand minority region reshape boundary. Experiments four show that proposed method achieves promising performance in terms Recall, Precision, F1-score, G-mean.
منابع مشابه
Oversampling Method for Imbalanced Classification
Classification problem for imbalanced datasets is pervasive in a lot of data mining domains. Imbalanced classification has been a hot topic in the academic community. From data level to algorithm level, a lot of solutions have been proposed to tackle the problems resulted from imbalanced datasets. SMOTE is the most popular data-level method and a lot of derivations based on it are developed to ...
متن کاملAdaptive Oversampling for Imbalanced Data Classification
Data imbalance is known to significantly hinder the generalization performance of supervised learning algorithms. A common strategy to overcome this challenge is synthetic oversampling, where synthetic minority class examples are generated to balance the distribution between the examples of the majority and minority classes. We present a novel adaptive oversampling algorithm, VIRTUAL, that comb...
متن کاملSynthetic Protein Sequence Oversampling Method for Classification and Remote Homology Detection in Imbalanced Protein Data
Many classifiers are designed with the assumption of wellbalanced datasets. But in real problems, like protein classification and remote homology detection, when using binary classifiers like support vector machine (SVM) and kernel methods, we are facing imbalanced data in which we have a low number of protein sequences as positive data (minor class) compared with negative data (major class). A...
متن کاملOversampling for Imbalanced Learning Based on K-Means and SMOTE
Learning from class-imbalanced data continues to be a common and challenging problem in supervised learning as standard classification algorithms are designed to handle balanced class distributions. While different strategies exist to tackle this problem, methods which generate artificial data to achieve a balanced class distribution are more versatile than modifications to the classification a...
متن کاملWEMOTE - Word Embedding based Minority Oversampling Technique for Imbalanced Emotion and Sentiment Classification
Imbalanced training data always puzzles the supervised learning based emotion and sentiment classification. Several existing research showed that data sparseness and small disjuncts are the two major factors affecting the classification. Target to these two problems, this paper presents a word embedding based oversampling method. Firstly, a large-scale text corpus is used to train a continuous ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Journal of Supercomputing
سال: 2023
ISSN: ['0920-8542', '1573-0484']
DOI: https://doi.org/10.1007/s11227-022-05037-7